
PHOSPHOINOSITIDE 3-KINASES 

J^^^ BACKGROUND OF THE INVENTION 

Phosphoinositide 3-kinases (PI3Ks) are ubiquitous lipid kinases playing key 
5 roles both as signal transducers downstream of cell-surface receptors and in 

constitutive intracellular membrane and protein trafficking pathways. All PDKs are 
dual specificity enzymes with a lipid kinase activity capable of phosphorylating 
phosphoinositides at the 3-hydroxyl and with a protein kinase activity. The products 
of PI3K-catalysed reactions, phosphatidylinositol 3,4,5-trisphosphate 

10 (PtdIns(3,4,5)P 3 ), PtdIns(3,4)P 2 and PtdIns(3)P act as second messengers for a variety 
of signal transduction pathways, including those essential to cell proliferation, 
adhesion, survival, cytoskeletal rearrangement and vesicle trafficking (1,2). 

The mammalian PI3Ks can be divided into three classes based on their 
structure and substrate specificity (2). The class I PDKs are receptor-regulated 

15 heterodimeric enzymes that preferentially phosphorylate PtdIns(4,5)P2 in vivo. The 
class IA PI3Ks (consisting of pi 10a, pi 10J3, or pi 105 catalytic subunits) associate 
with an 85 kDa adaptor that is essential for interaction of these PDKs with receptor 
tyrosine kinases. The class IB PDK (PDKy) is activated by heterotrimeric G protein 
subunits and associates with a plOl adaptor that is important for full responsiveness - 

20 to G(3y heterodimers (3,4). Class I PDKs are also activated by Ras. Class II PDKs are 
distinguished by a C-terminal C2 domain and preferentially use Ptdlns and PtdIns(4)P 
as substrates. Class EQ enzymes phosphorylate only Ptdlns and lack the Ras-binding 
domain. 



25 BRIEF DESCRIPTION OF THE DRAWING 

Fig. 1 shows the overall structure of PI3KC (A) A ribbon diagram of PDK 
(prepared with MOLSCRIPT) showing the four domains: the RBD, the C2 domain, 
the helical domain and the catalytic domain with N-lobe and C-lobe. The N-terminal 
region preceding the RBD and the ordered portion between the RBD and C2 domain 
30 are white. (B) The solvent-accessible surface of the enzyme in the same orientation 
(prepared with GRASP (29)). (C) A block diagram showing the domain organization 
of the PDK classes. 
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Fig. 2 shows a schematic representation of the catalytic domain of PI3K. (A) A 
ribbon diagram of the PI3K catalytic domain with bound ATP. The two disordered 
residues in the middle of the activation loop are represented by dotted lines. (B) The 
active conformation of the Src family protein kinase Lck (30) (PDB entry 31ck). (C) 
5 A stereo representation of PI3K active site with bound ATP and two Lu 3+ ions 
(labelled Me I and Me II). 

Fig. 3 show the complete amino acid sequence of a porcine PI3Ky . The ( ) 

indicate gaps when the PI3Ky is aligned with other members of the PI3K family. The 
other PI3Ks are not shown, but are incorporated by reference to Walker et al., Nature, 
10 402:313-320,2000. 

Fig 4 is a model of phospholipid headgroup interactions with PI3K. (A) Two 
views of the solvent-accessible surface of the enzyme. The activation loop is coloured 
black. An inositol 1,4,5-trisphosphate (InsP 3 ) molecule (white ball-and-stick) has 
been modelled in the active site with the 3-OH near the y-phosphate of the bound 
15 ATP. (B) The same two views of the enzyme in ribbon representation with the 

activation loop and L1SP3 cyan. The right portion of the panel has been expanded to 
illustrate some features of the putative headgroup interaction. 

Fig. 5 shows a model of the Ras/PI3K interaction based on the structure of the 
RalGDS/Ras complex. The inset shows an overall view of Ras/PI3K interaction. 
20 Residues in Switch I and Switch II regions of Ras that influence effector binding are 
highlighted with stripes, while residues in the RBD of PI3K that are likely to be 
involved in Ras binding are shown as black stripes. The proximity of the RBD to the 
two lobes of the catalytic domain is also illustrated. 

Fig. 6 is a ribbon diagram of the PDK7C2 domain and the interactions it 
25 makes with the remainder of the enzyme. The elements of the helical and catalytic 

domains interacting with the C2 domain are shown. The inset shows the area selected 
for the detailed illustration. 

Fig. 7 is a schematic illustration of the helical domain. The A/B anti-parallel 
helical pairs characteristic of the HEAT motif topology consist of hAl/hBl, hA2/hB2, 
30 hA3/hB3, hA4/hB4 and hA5/hB5. The left half of the panel illustrates the interaction 
that the helical domain makes with the RBD and C2 domain (the remainder of the 
protein was removed for clarity). This interaction involves principally the A-helix 
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surfaces. The interactions between the helical domain and the catalytic domain are 
shown on the right. 

Fig. 8 shows the complete amino acid sequence of porcine PBKy. 

DESCRIPTION OF THE INVENTION 

The present invention relates to phosphoinositide 3-kinases (PBKs), a class of 
enzymes involved in signal transduction and in constitutive intracellular membrane 
protein trafficking pathways. PBKs possess dual catalytic functions, possessing both 
lipid kinase and protein kinase activity. The products of PBK-catalyzed reactions are 
second messengers in a variety of signal transduction pathways, including those 
involved in cell proliferation, adhesion, survival, cytoskeletal rearrangement, and 
vesicle trafficking. Thus, modulating its activities is useful for regulating cellular 
activities, e.g., involved in inflammation, repair, healing, development, and 
differentiation (e.g., for regulating stem cell growth and differentiation). 

In accordance with the present invention, the three dimensional structure of a 
PBKy has been determined. The present invention thus relates to a PBKy crystal with 
unit dimensions of about a=143.3 A, b=67.6 A, c=107.0 A, and p=95.9°. The 
crystals have C2 symmetry and contain one molecule in the unit cell. Crystals can be 
grown and analyzed by any effective methods, such as methods described in the 
examples below. 

The present invention also relates to PBKy polypeptide muteins, polypeptide 
fragments, antibodies thereto, nucleic acids coding for these polypeptides, methods of 
modifying PBKy activity, and methods of modulating PBKy activity. These include 
polypeptides and methods thereof, relating to, e.g., phospholipid binding, lipid kinase 
activity, modulating Ras activity in activating the PBKy, binding of PBKy to cell 
membranes, and modulating protein-protein interactions with PBKy. Polypeptides, 
nucleic acids, and antibodies can be prepared according to any effective method. 

By the term "mutein," it is meant any non-naturally occurring mutation. 
Mutations can be introduced by any suitable method, e.g. by site-directed 
mutagenesis, by routinely by modifying or mutating a nucleotide sequence coding for 
an amino acid sequence of Fig. 3, and selecting for those mutations that affect one or 
more of its activities, e.g., by measuring activity as described in Bondeva et al., 
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Science, 282:293-296, 1998. Muteins can comprise amino acid substitutions, 
insertions, and deletions, including replacing naturally-occurring amino acids with 
non-naturally occurring amino acids. Amino acid substitution can be made by 
replacing one homologous amino acid for another. Homologous amino acids can be 
5 defined based on the size of the side chain and degree of polarization, including, small 
nonpolar: cysteine, proline, alanine, threonine; small polar: serine, glycine, aspartate, 
asparagine; large polar: glutamate, glutamine, lysine, arginine; intermediate polarity: 
tyrosine, histidine, tryptophan; large nonpolar: phenylalanine, methionine, leucine, 
isoleucine, valine. Homologous acids can also be grouped as follows: uncharged 

10 polar R groups, glycine, serine, threonine, cysteine, tyrosine, asparagine, glutamine; 
acidic amino acids (negatively charged), aspartic acid and glutamic acid; basic amino 
acids (positively charged), lysine, arginine, histidine. Homologous amino acids also 
include those described by Dayhoff in the Atlas of Protein Sequence and Structure 5, 
1978, and by Argos in EMBO J., 8, 779-785, 1989. Homologous amino acid 

15 replacement or modification can be utilized when it is desired to maintain, or enhance 
a PI3Ky activity. Non-homologous amino acid replacement or modification can be 
utilized when it is desired to destroy or decrease a PDKy activity. A polypeptide 
mutein, and its corresponding nucleotide coding sequence, can have an amino acid 
sequence as set forth in Fig. 3 except where one or more positions are substituted by 

20 homologous amino acids, e.g., where there are 1,5, 10, 15, or 20 substitutions. 

Amino acid substitutions can also be made based on analogy to related other PI3Ks. 

A PDKy of the present invention, fragments, and muteins thereof, can also 
comprise various modifications, where such modifications include lipid modification, 
methylation, phosphorylation, glycosylation, covalent modifications (e.g., of a side 

25 chain of an amino acid). Modifications to the polypeptide can be accomplished 
according to various methods, including recombinant, synthetic, chemical, etc. 

The present invention also relates to antibodies which are "specific-for" a 
particular polypeptide comprising a defined amino acid sequence of a PI3Ky. The 
phrase "specific-for" indicates that the antibody is selective for the defined amino acid 

30 sequence. The amino acids sequences can possess other immunogenic activities, as 
well, e.g., stimulating of T-cells, macrophages, B-cells, dendritic cells, etc. These 
responses can be measured routinely. 
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An aspect of the present invention relates to polypeptides, fragments and 
muteins, of PDKs that possess phospholipid binding, and PI3Ks which display 
modified phospholipid binding activity. As mentioned, PI3Ks phosphorylate 
phosphoinositides, and analogs and derivatives thereof, at a 3-hydroxyl group. The 
5 catalytic reaction involves binding of a phospholipid substrate to the enzyme. A 
polypeptide fragment of PI3Ky has been identified which possesses phospholipid 
activity. This fragment can also be referred to a phospholipid binding domain to 
indicate its primary activity. By the term "fragment," it is meant any sequence of 
amino acids which is less than the full-length size of a PI3K. A PI3K phospholipid 
10 binding domain, preferably consists essentially of a C-terminal helix kal2, catalytic 
loop, activation loop, and amino acid residues Lys807, Lys808, Arg947, and Lys973. 
The catalytic loop preferably consists essentially of amino acids 943-951 and the 
activation loop preferably consists essentially of amino acids 964-988, and kal2 
preferably consists essentially of amino acids 1081-1090. 

15 A PI3K mutein, or polypeptide fragment thereof, which possesses 

phospholipid binding activity preferably comprises Lys807, Lys808, Arg947, and 
Lys973. Muteins which possess less than normal phospholipid binding activity 
preferably comprise amino acid substitutions at one or more positions Lys807, 
Lys808, Arg947, and Lys973. Phospholipid binding to PI3K can be measured 

20 conventionally, e.g., using radiolabeled phospholipids. 

By the phrase "less than normal binding activity," it is meant that such mutein 
(full-length PI3K or a fragment thereof), displays an amount of activity which is 
reduced when compared to the wild-type, not mutated, enzyme. Such amount can be 
reduced by any quantity, e.g., 5%, 10%, 25%, 50%, or even a total loss of activity. 

25 Phospholipid binding activity can be measured by any effective method. 

An isolated polypeptide mutein of PI3K can comprise a phospholipid binding 
domain, which domain comprises the C-terminal helix keel 2, catalytic loop, and 
activation loop sequences of Fig. 3, and at least 95% sequence identity to the 
remaining sequence in Fig. 3. In general, the phrase that a domain, region, etc., 

30 comprises a sequence of Fig. 3, it meant that the polypeptide has 100% sequence 

identity to the sequence disclosed in Fig. 3. For this mutein, the phospholipid domain 
has 100% sequence identity to its sequence in Fig. 3, but the remaining regions have 
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less 100% sequence identity, such as 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 
more, but less than 100%. Such muteins can lack all known catalytic activities of 
PI3K, but still possess the phospholipid binding activity. 

The present invention also relates to methods of modulating phospholipid 
5 binding activity, e.g., binding of a phospholipid substrate to a PI3K enzyme, fragment, 
mutein, etc. Modulating can be accomplished in any manner, e.g., modifying the 
amino acid sequence of PI3K, contacting an active site or region with a modifying 
agent, e.g., a chemical agent which modifies the chemical groups, contacting an active 
site or region with a ligand, e.g., an antibody. In particular, as mentioned above, 

10 regions involved in phospholipid binding, comprise a C-terminal helix kal2, catalytic 
loop, and activation loop. The catalytic loop preferably consists essentially of amino 
acids 943-951 and the activation loop preferably consists essentially of amino acids 
964-988. Modification of these regions, particularly amino acids Lys807, Lys808, 
Arg947, and Lys973, can affect substrate binding activity. 

15 Another aspect of the present invention relates methods of modulating lipid 

kinase catalysis. Inhibition can be accomplished by various methods, including, e.g., 
modifying the amino acid sequence of a PI3K, contacting an active site, or amino acid 
residue thereof, with a modifying agent, etc. For example, the histidine at amino acid 
968 has been identified as involved in the deprotonation of the 3-hydroxy of the lipid 

20 headgroup. Alteration of the histidine would be expected to effect the enzyme's 

activity, e.g., by inhibiting, blocking, decreasing, reducing, enhancing, increasing, etc., 
its activity. Non-conservative (non-homologous) amino acid substitution can be 
expected to reduce catalytic activity. Conservative (homologous) amino acid 
substitution can be expected to not affect, or to increase catalytic activity. 

25 Another way of modulating the lipid kinase activity is to modify the amino 

acid sequence of a produced PI3K, e.g., at its active site, such as the amino acid 
residues surrounding and including His968. Any agent which can chemically modify 
an amino acid can be used, including, oxidizing agents, reducing agents, alkylating 
agents, etc. In addition, ligands which attach to the active site can be used, such as 

30 substrate analogs, antibodies, e.g., an antibody which recognizes an amino acid 
sequence comprising His968. 
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The present invention relates to polypeptides, muteins, etc., which comprise 
His968. Such polypeptide fragments can be useful to prepare antibodies, to inhibit 
lipid kinase activity by competing for substrates, etc. Useful polypeptides include, 
e.g., polypeptides which consist of about 500, 200, 100, 50, 30, 20, 10, 8, etc., the 
5 activation loop (e.g., amino acids 964-988, 950-988), etc. These polypeptides can be 
effective in eliciting an immune response to amino acid His968, and flanking regions 
thereof. 

A PI3Ky polypeptide mutein, comprising a sequence having at least 95% 
amino acid sequence identity to Fig. 3, and having a His968. Such a mutein can lack 

10 any of the mentioned activities of PI3K, but still retain its lipid kinase activity. 

The present invention also relates to a method of modulating Ras activity in 
activating the PDKy. Ras binding to PDKy through its Ras binding domain ("RBD") 
leads to enzyme activation. The regions involved in the Ras interaction with PI3Ky 
have been identified to include, e.g., a) specific regions of the N-terminal lobe of the 

15 catalytic region, such as kpl-k(52, kp4-k{35; ka6; b) RBD regions, such as Ra2 and 
Rp3 -Rp4; and c) RBD residues, such as Lys234, Asp238, and Lys255. Modification 
of any these regions or specific residues can be effective in modulating Ras activity, 
e.g., inhibiting, blocking, decreasing, reducing, enhancing, increasing, etc., its activity. 
By the phrase, "Ras modulatory activity," it is meant any activity in which PI3K 

20 affects Ras, including binding to Ras, activating Ras, etc. Ras modulatory activity can 
be measured conventionally, e.g., as described in Bondeva et al., Science, 282:293- 
296, 1998. 

The present invention also relates to PI3K polypeptides and muteins thereof 
which relate to the Ras modulatory activity. For instance, polypeptides involved in 

25 Ras modulatory and/or binding activity to PI3K, includes, e.g., k[5l-kp2 (782-794), 
kp4-kp5 (816-825); koc6 (921-942); RBD regions, such as Roc2 (291-300) and R|33 - 
R[34 (276-278, and fragments which comprise residues, such as Lys234, Asp238, and 
Lys255, e.g., Useful polypeptides include, e.g., polypeptides comprising, consisting 
of, consisting essentially, 782-816, 782-906, 795-816, 809-906, 817-906, and various 

30 combinations of any of the mentioned regions. A PI3K mutein can comprise 100% 
amino acid sequence identity with Fig. 3 at one or more of the above-mentioned 
domains, and less than 100% identity, such as 85%, 90%, 95%, 99%, or more, at any 
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other regions of the PI3K. Since such mutein comprises the sequences that are used to 
interact with Ras, the mutein would possess Ras modulatory activity, but can lack 
other activities. A preferred Ras binding domain polypeptide consists essentially 
amino acid residues 220-31 1, having Lys234, Asp238, and Lys255, but less than 
5 100% amino acid sequence identity at other positions, e.g., 70%, 75%, 80%, 85%, 
90%, 95%, 99%, or more, at any other regions of the PI3K. 

Antibodies, or other specific ligands, can be used to block Ras binding to 
PI3K, or the activation that results from such interaction. Regions a) and b) are 
involved in the intramolecular interaction of the N-terminal catalytic lobe with the 

10 RBD. Antibodies to any of such regions can block Ras interaction. Residues of RBD, 
such as Lys234, Asp238, and Lys255, form bonds with Ras (See, Table). Useful 
antibodies which block the Ras binding to PI3K, preferably include antibodies which 
are specific for a peptide comprising amino acids Lys234, Asp238, and, Lys255. In 
addition, antibodies to any of the above-mentioned regions which are involved with 

15 Ras modulatory can also be used. 

The present invention also relates to methods of inhibiting the binding of 
PI3Ky to cell membranes, comprising, e.g., modifying an amino acid a) the lining the 
crevice region between the N- and C-lobes (about residues 844-950, especially 
residues 844, 847, 947, 948, and 950 which form part of the phospholipid head-group 
20 pocket); b) the CBR regions (about residues 371-380, 401-407 and 434-459); or c) 
the region comprising the activation loop (about residues 964-989, especially 967). 
As already mentioned, "modifying" can mean replacing or chemically modifying 
amino acids in the mentioned domains, or contacting with ligands, such as antibodies, 
which recognize specifically the mentioned domains. 

25 Additional surface residues of PI3K are, e.g., about 755-756, 807-808, 994- 

905. and 1077-1084. Thus, inhibiting or modifying any of these residues, as 
discussed above and below, are useful to prevent binding of PI3K to cell membranes. 

Polypeptide fragments or muteins of PI3K which possess cell membrane 
binding activity can be used as modulators of the cell-membrane binding activity. For 
30 instance polypeptide fragments coding for regions a), b), and/or c), or parts thereof, 
can be administered in vivo or in vitro as antagonists to prevent an endogenous 
enzyme from targeting to cell membranes. Antibodies specific-for these regions can 
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be used in the same way. A preferred polypeptide mutein has about 100% sequence 
identity with regions a), b), and/or c), and less than 100%, e.g., 99%, 95%, 90%, 85%, 
80%, 70%, or more, sequence identity with the remaining regions of a PI3K shown in 
Fig. 3. 

5 PI3K has a helical domain consisting of five A/B pairs of anti-parallel helices. 

Much of the B -surface is solvent exposed, providing a surface for interaction with 
other proteins, such as the plOl adaptor or G$y subunits. The present invention thus 
relates to modulating protein-protein interactions with PDKy, comprising modifying 
the surfaces of the B-helices. The B-surfaces comprise, the exposed parts of B- 

10 helices, hBl (570-577), hBl' (579-586), hB2 (601-613), hB2' (615-619), hB3 (638- 
650) hB4 (676-686), or hB5 (710-722). The modifying can comprise contacting said 
amino acid with an antibody specific-for hBl, hBl \ hB2, hB2', hB3, hB4, or hB5, or 
by replacing, substituting, deleting, modifying, etc., an amino acid at such regions. A 
preferred polypeptide mutein or fragment (e.g., consisting essentially of 570-722) of a 

15 PI3Ky comprises 100% sequence identity to hBl-hB5 of Fig. 3, and less than 100% 
sequence identity to the remaining sequence in Fig. 3, e.g., 50%, 60%, 70%, 75%, 
80%, 85%, 90%, 95%, 99%, etc. Such mutein would retain the ability to interact with 
proteins. 

Nucleic acids which code for any of the polypeptides, polypeptide fragments, 
20 and muteins thereof, can be prepared conventionally, using naturally-occurring or 
synthetic nucleotide sequences. See, e.g., Maniatis et al., Molecular Cloning, A 
Laboratory Mammal, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 
1989. 

Antibodies, e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, 
25 single-chain, Fab, etc., can be prepared according to any desired method. See, e.g., 
screening recombinant immunoglobulin libraries (e.g., Orlandi et al., Proc. Natl. 
Acad. Sci., 86:3833-3837, 1989; Huse et al., Science, 256:1275-1281, 1989); in vitro 
stimulation of lymphocyte populations (Winter and Milstein, Nature, 349: 293-299, 
1991). For example, for the production of monoclonal antibodies, a polypeptide 
30 according to the present invention can be administered to mice, goats, or rabbits 
subcutaneously and/or intraperitoneally, with or without adjuvant, in an amount 
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effective to elicit an immune response. The antibodies can be IgM, IgG, subtypes, 
IgG2a, IgGl, etc. Antibodies, and immune responses, can also be generated by 
administering naked DNA. See, e.g., U.S. Pat. Nos. 5,703,055; 5,589,466; 5,580,859. 
Polypeptides for use in the induction of antibodies do not need to have 
5 biological activity; however, they must have immunogenic activity, either alone or in 
combination with a carrier. Peptides for use in the induction of specific-for antibodies 
may have an amino sequence consisting of at least five amino acids, preferably at least 
10 amino acids. Short stretches of amino acids, e.g., five amino acids, can be fused 
with those of another protein such as keyhole limpet hemocyanin, or another useful 

10 carrier, and the chimeric molecule used for antibody production. Regions of PI3K 
useful in making antibodies are mentioned above and in the examples below. 

The polypeptides and antibodies of the present invention may be used with or 
without modification. Frequently, the polypeptides and antibodies will be labeled by 
joining them, either covalently or noncovalently, with a substance which provides for 

15 a detectable signal. A wide variety of labels and conjugation techniques are known 

and have been reported extensively in both the scientific and patent literature. Suitable 
labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent 
agents, chemiluminescent agents, magnetic particles and the like. Patents teaching the 
use of such labels include U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 

20 4,277,437; 4,275,149; and 4,366,241. 

Antibodies and other ligands which bind PI3K can be used in various ways, 
including as therapeutic, diagnostic, and commercial research tools, e.g., to quantitate 
the levels of Pi3k polypeptide in animals, tissues, cells, etc., to identify the cellular 
localization and/or distribution of it, to purify it, or a polypeptide comprising a part of 

25 it, to modulate the function of it, in Western blots, ELIZA, immunoprecipitation, RIA, 
etc. The present invention relates to such assays, compositions and kits for 
performing them, etc. 

In addition, ligands which bind to a PI3K according to the present invention, 
or a derivative thereof, can also be prepared, e.g., using synthetic peptide libraries or 

30 aptamers (e.g., Pitrung et al., U.S. Pat. No. 5,143,854; Geysen et al., J. Immunol. 
Methods, 102:259-274, 1987; Scott et al., Science, 249:386, 1990; Blackwell et al., 
Science, 250:1 104, 1990; Tuerk et al., 1990, Science, 249: 505.). 
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A polypeptide of the present invention can be combined with one or more 
structural domains, functional domains, detectable domains, antigenic domains, 
and/or a desired polypeptide of interest, in an arrangement which does not occur in 
nature, i.e., not naturally-occurring. A polypeptide comprising such features is a 
5 chimeric or fusion polypeptide. Such a chimeric polypeptide can be prepared 

according to various methods, including, chemical, synthetic, quasi-synthetic, and/or 
recombinant methods. A chimeric nucleic acid coding for a chimeric polypeptide can 
contain the various domains or desired polypeptides in a continuous (e.g., with 
multiple N-terminal domains to stabilize or enhance activity) or interrupted open 

10 reading frame, e.g., containing introns, splice sites, enhancers, etc. The chimeric 
nucleic acid can be produced according to various methods. See, e.g., U.S. Pat. No. 
5,439,819. A domain or desired polypeptide can possess any desired property, 
including, a biological function such as signaling, growth promoting, cellular 
targeting (e.g., signal sequence, targeting sequence, such as targeting to the 

15 endoplasmic reticulum or nucleus), etc., a structural function such as hydrophobic, 
hydrophilic, membrane-spanning, etc., receptor-ligand functions, and/or detectable 
functions, e.g., combined with enzyme, fluorescent polypeptide, green fluorescent 
protein, (Chalfie et al., Science, 263:802, 1994; Cheng et al., Nature Biotechnology, 
14:606, 1996; Levy et al., Nature Biotechnology, 14:610, 1996), etc. In addition, a 

20 polypeptide, or a part of it, can be used as a selectable marker when introduced into a 
host cell. For example, a nucleic acid coding for an amino acid sequence according to 
the present invention can be fused in-frame to a desired coding sequence and act as a 
tag for purification, selection, or marking purposes. The region of fusion can encode 
a cleavage site to facilitate expression, isolation, purification, etc. 

25 A polypeptide according to the present invention can be produced in an 

expression system, e.g., in vivo, in vitro, cell-free, recombinant, cell fusion, etc., 
according to the present invention. Modifications to the polypeptide imparted by such 
systems include glycosylation, amino acid substitution (e.g., by differing codon 
usage), polypeptide processing such as digestion, cleavage, endopeptidase or 

30 exopeptidase activity, attachment of chemical moieties, including lipids and 
phosphates, etc. 
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EXAMPLES 

We have determined the structure of the catalytic subunit (residues 144-1 102) 
of porcine PDKy. This construct contains all of the homology regions found in class I 
PI3Ks (HR1, HR2, HR3 and HR4) and has a catalytic activity similar to the full length 
5 enzyme. The N-terminal region absent in our construct of PI3Ky is important for 
interaction with the plOl adaptor (5), and the analogous region of PBKoc interacts 
with the p85 adaptor. The enzyme has a modular structure consisting of four 
domains: a Ras-binding domain (RBD), a C2 domain, a helical domain and a catalytic 
domain (Fig. 1). The RBD, C2 and catalytic domains have folds similar to these 
10 modules in other proteins involved in signal transduction. The helical domain has a 
fold akin to HEAT-repeat containing structures involved in protein-protein 
interactions. 

The catalytic domain of the enzyme consists of a smaller N-terminal lobe 
(residues 726-883) and a larger C-terminal lobe (884-1092). The portion of the N- 

15 terminal lobe from kp3 to ka3 and the first part of the C-terminal lobe (up to the end 
of k|3l0) have a fold similar to protein kinases (reviewed in (6)), and this similarity 
extends to many of the details of the ATP binding site (Fig. 2). This region is among 
the most conserved regions of the PI3Ks (Fig. 3). The structural similarity of PI3K to 
protein kinases is consistent with finding that PI3Ks have a protein kinase activity in 

20 addition to their lipid kinase activities (7,8). The sequence alignment in Fig. 3 
illustrates the regions of the enzyme that structurally superimpose with tyrosine 
protein kinase c-Src. The N-terminal lobe consists of a five-stranded antiparallel P~ 
sheet flanked on one side by a helical hairpin (kal-kot2) and a small two-stranded p- 
sheet (P1-P2) and on the other side by the koc3 helix and the C-terminal lobe. Strands 
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kp3-kP7 correspond to the five-stranded (3-sheet found in the protein kinases. The 
kp3-k(34 loop corresponds to the protein kinase Pl-p2 loop (also known as the 
glycine-rich or P-loop). This loop interacts closely with the phosphates of the bound 
ATP, but unlike the protein kinases, it contains no glycine. Instead, the side chain of 

5 Ser 806, a residue that is conserved in all PDKs, interacts with the p-phosphate (Fig. 
2). Residue Lys 833 at the end of k(35, corresponding to Lys 72 of c-AMP-dependent 
protein kinase, interacts with the a-phosphate of ATP. This residue is conserved in 
all PDKs and is covalently modified by Wortmannin (9). There are two metal binding 
sites (Fig. 2). Me I interacts with the conserved Asn 95 1 while Me II interacts with 

10 Asp 836 and Asp 964. 

The link between the N- and C-terminal lobes is via a loop between strands 
k(37 and k(38. This loop forms the deepest wall of the ATP binding pocket and 
provides two hydrophobic contacts with the adenine moiety of the ATP. The C- 
terminal lobe forms a portion of the ATP binding site as well as the binding site for 

15 phospholipid substrates. The region between ka6 and k|39 (residues 943-951) 

corresponds to the catalytic loop of the protein kinases. Mutations of residues in this 
loop analogous to Asp 946, Arg 947, Asp 950 and Asn 951 abolish kinase activity of 
PI3Ks (7,8). 

The C-terminal lobe contains a segment (964-988) analogous to the activation 
20 loop in the protein kinases; this loop is essential for the substrate specificity of the 

PDKs (10). In the ATP/Lu 3+ complex, much of this loop (968-982) is disordered. In 
the structure of an enzyme/chloramine T complex, all but two residues (Phe 975 and 
Leu 976) of this segment are visible, although high B-factors suggest that this loop is 
flexible. The activation loop is on the surface of the enzyme between the C-terminal 
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helix kocl2 on one side and kalO on the other. We have attempted to soak 
phospholipid analogues into PI3Ky crystals, but no substrate was evident in the 
electron density. Consequently, we have modelled phospholipid headgroup binding, 
but because conformational changes are likely to occur in the activation loop and 
possibly in the C-terminal helix upon substrate binding, the model is only 
approximate. In this model, the headgroup is positioned in a cavity lined by the C- 
terminal helix kal2, the activation loop and the catalytic loop (Fig. 4). This would 
place the 5-phosphate of a PtdIns(4,5)P2 adjacent to Lys 973 and the 1 -phosphate near 
Lys 807 and Lys 808. The involvement of Lys 973 as a ligand of the 5-phosphate 
might explain why this residue is not present in the class II PI3Ks which do not 
phosphorylate phosphoinositides with a 5-phosphate. The basic residues nearest the 
4-phosphate are Arg 947 and Lys 973. The specificity of the class HI PI3Ks for 
phosphatidylinositol might be explained by their shorter activation loop that might not 
leave sufficient space to accommodate a 4-phosphate at the bottom of the headgroup- 
binding pocket. PI3K5 autophosphorylates in a region just beyond the C-terminal 
helix kal2 (11), resulting in enzyme inhibition probably by sterically preventing 
substrate binding. The proximity of the C-terminal segment to the substrate binding 
site is consistent with autophosphorylation of this region. 

The mechanism originally proposed for the enzymatic activity of protein 
kinases involved participation of a residue acting as a general base to deprotonate the 
hydroxyl of the substrate generating a nucleophile that would attack the y-phosphate 
of ATP. In cAMP-dependent protein kinase (cAPK), Asp 166 has been proposed to 
play the role of this general base. This residue corresponds to Asp 946 of the PI3Ky 
946-DRH-948 sequence that is conserved in all PI3Ks. However, in the structure of 

Attorney Docket: ONYX 1048 
ONYX 1 048/Sneciflcation/1 04R-1 JS 



-15- 



PI3Ky, Asp 946 is not positioned so that it could have the role of a general base 
catalyst. The constellation of residues in the active site in the presence of ATP/metal 
suggests that Asp 946 may simply have a structural role in maintaining the integrity of 
the ATP-binding pocket. Therefore, either the enzyme has no general base catalyst, in 
5 which case the mechanism could be primarily dissociative, involving a metaphosphate 
transition state (12) or a different residue assumes this role in the PI3Ks. One 
candidate for the role of a general base may be His 948. Although the side-chain of 
His 948 is not near the y-phosphate of ATP, a rotation around % 1 would place the side 
chain in a location such that it might interact with the 3-hydroxyl of the lipid 
10 headgroup. 

PI3Ks have been identified as one of the effectors for Ras proteins (reviewed in 
(13)). Binding of PI3K to Ras is affected by mutations in both switch I and switch II 
regions of Ras (residues 30-38 and 60-76, respectively) (14,15). These two regions 
are known to change conformation upon GTP binding and serve as binding sites for a 
15 diverse array of downstream effectors. However, mutations in these switch regions 
have been identified that differentially affect binding of various effectors. 

The structure reveals that the RBD of PI3Ky (residues 220-31 1) has the same 
fold as the RBD of Raf (16) and RalGDS (17), two other well-characterised effectors 
of Ras (Fig 5). The RBD of PI3K consists of a five-stranded mixed (3-sheet (Rfi 1- 
20 Rp5) flanked by two oc-helices (Ral and Ra2). Residues 228-230 (in the Rpl/R|32 
loop) and 257-265 (in the Ral/RP3 loop) are disordered. 

The crystal structure of Ras-related protein RaplA in complex with the RBD 
of protein kinase c-Raf (16) and the structure of Ras in complex with the RBD of 
RalGDS (17), suggests a structural basis for effector specificity. For both of these 
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complexes, the structure was determined with the isolated RBD, without the catalytic 
portions of the effector molecules. The PI3Ry structure shows how the RBD interacts 
with the remainder of the enzyme. The RBD of PDKy contacts the N-lobe and to a 
lesser degree the C-lobe of the catalytic domain. RBD residues in Roc2 and the 
5 R(33/Rf}4 loop interact with the catalytic domain, mainly with the k|Sl/k{32 and 

kp4/kp5 loops and helix kot6. The position of the RBD of PDKyin relationship to 
the remainder of the enzyme allows for two possible mechanisms by which Ras 
binding might cause effector activation. One possibility is a recruitment mechanism 
whereby Ras increases PI3K activity by translocating the enzyme to the plasma 

10 membrane. A second possibility would be an allosteric mechanism in which Ras 

binding to the RBD causes a conformational change that would be propagated through 
the RBD/catalytic domain interface to affect substrate or co-factor binding. 

By superimposing the RBDs of RalGDS and PI3Ky, it is possible to construct 
a model of Ras interaction with PI3Ky (Fig. 5). With this model, we can rationalise 

15 the differential effects of various switch I and switch II mutants on PI3K binding as 
opposed to other effectors. Mutations in Ras switch I residues T35S and D38E 
eliminate PI3K binding, but do not affect Raf binding (15). The E37G mutation 
abolishes binding to PI3K and Raf but not to RalGDS. The Y40C mutation does not 
affect PI3K binding, but abrogates Raf and RalGDS binding. In the switch II region, 

20 the Y64G mutation eliminates PI3K and neurofibromin binding but has no effect on 
Raf binding (14). In the model of the PI3K/Ras interaction, residues E37, D38, Y40 
and Y64 would be at the PI3K/Ras interface. PI3K K234 would be in a position to 
form a salt bridge to E37 of Ras and K255 at the C-terminal end of Ral could form a 
salt link with D38 of Ras. K255 in PDKyis probably analogous to K227 in PBKot. 
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Mutation K227E blocks PBKa binding to Ras (15). Y40 interacts with K32 in 
RalGDS (numbering as in (17)). However, because of a very different orientation of 
the K32 equivalent in PDKy (K234) this interaction may not be possible. This could 
account for the insensitivity of PI3K to the Y40C mutation. On the other hand, Y64 
in switch II would be in a position to form a hydrogen bond with PI3K D238, but this 
residue has no specific interaction with RalGDS. This may explain the sensitivity of 
PI3K to the Ras Y64G mutation. 

The PDKy C2 domain (residues 357-522) is an eight-stranded antiparallel 0- 
sandwich consisting of two four-stranded P-sheets (Fig. 6). The fold of this domain is 

the same as the type II C2 domain found in PLC51 The N-terminal regions of all 
three PI3K classes have C2 domains, while the class II enzymes have an additional 
C2 domain at the C-terminus (Fig. 1). The segments leading from the RBD into the 
C2 domain and from the C2 domain to the helical domain are not ordered. 

C2 domains are often involved in Ca 2+ -dependent or Ca 2+ -independent 
phospholipid membrane binding using three loops known as CBRs located at one end 
of the domain. The CBRs for PDKy are the loops connecting pi with p2 (CBR1), p3 
with p4 (CBR2), and P5 with p6 (CBR3). The CBR3 of PDKy is quite long compared 
to other C2 domains and is disordered in our structure. The C2 domain interacts 
primarily with the helical domain, but it also interacts with the linker segment before 
the RBD and with the C-terminal lobe of the catalytic domain. The surface of the C2 
domain contacting the rest of PDKy is nearly identical to the surface of the PLC51 C2 
domain that contacts the catalytic domain of PLCS L 

PI3K can bind phospholipid membranes in the absence of other protein 
components, in a Ca 2+ -independent manner and carry out processive catalysis at the 
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membrane surface. By analogy with other enzymes such as protein kinase C and 
cytosolic phospholipase A2, it may be that the C2 domain of PI3K participates in 
membrane interaction. Consistent with this, we have found that the isolated C2 
domain from PI3Ky binds multilamellar phospholipid vesicles similarly to the full- 
5 length enzyme (data not shown). In PI3K(3 and PI3K5, CBR3 (residues 395-417 of 
PI3K8) is particularly rich in basic residues that may be important for membrane 
binding. 

The structure of a type 11(3 phosphatidylinositol phosphate kinase (PIPK) was 
recently reported (19). This dimeric enzyme, which phosphorylates phosphoinisitides 

10 at the 4-hydroxyl, consists of a single, catalytic domain. The dimer has an extensive 
flat, positively-charged surface that was proposed to be the membrane-binding 
interface of the enzyme. Although the N-lobe of PDPK is structurally related to the 
catalytic domain of PI3Ky, the location of the PI3Ky C2 domain with respect to the 
catalytic domain would sterically preclude membrane interactions using the surface of 

15 PI3Ky analogous to the putative PIPK membrane-binding surface. Given the location 
of the membrane-binding loops from the C2 domain and the cavity in the catalytic 
domain that must accommodate the PtdIns(4,5)P 2 headgroup, the membrane-binding 
surface of PI3Ky would be such that the CBRs, the crevice between the N- and C- 
lobes of the catalytic domain and the tip of the activation loop would face the 

20 membrane interface (Fig. 4A, right panel would represent a view from the membrane 
surface). 

The helical domain of PI3K (residues 545-725) consists of five A/B pairs of 
anti-parallel helices (Fig 7). The first two pairs have one kinked helix each, hBl/hBT 
and hB2/hB2'. This region has been variously referred to as HR2, the PI3K accessory 
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domain, and the PIK domain, but no clear function has been ascribed to this region. 
The paired arrangement of a series of helices connected into a right-handed super 
helix is reminiscent of the PR65/A regulatory subunit of protein phosphatase 2A 
(PP2A) (20). PR65/A is a member of a diverse group of proteins that contain between 
5 three and 25 tandem repeats of a short sequence that has been termed the HEAT 

motif. The HEAT motif consists of paired helices A and B arranged so that the A and 
B helices within a pair are anti-parallel and the A and B helices from one motif are 
parallel to the A and B helices of the next motif in the sequence. Although no HEAT 
sequence motif is apparent in the helical domain of PI3K, its structure is quite similar 

10 to that of PR65/A in terms of the arrangement of helices, the length of the A/B units, 
and the angle between the A/B pairs. 

The function of HEAT repeats is to form protein/protein interactions. In the 
case of importin-p, the interactions that the protein makes with the small GTPase Ran 
involve primarily the surfaces of the B helices (21). For PR65/A, mutagenesis has 

15 implicated the loops connecting the A/B pairs as the region responsible for interaction 
with PP2A (20). In PI3Ky, the helical domain is central to the inter-domain packing. 
The surface formed by the A helices interacts with the catalytic domain. The loops 
connecting A and B helices within a pair pack against the C2 domain while the loops 
between helical pairs pack against the RBD (Fig. 7). Much of the "B" surface is 

20 solvent-exposed and may interact with other proteins known to bind PI3Ky such as the 
plOl adaptor or Gpysubunits. 

The helical domain is common to both PI3K and PI4K families and serves as a 
spine on which the other domains are fastened. One of the proteins in which the 
HEAT sequence motif was first noted is the target of rapamycin, TOR, a yeast 
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homologue of human FRAP (reviewed in (22)). FRAP has a C-terminal domain with 
clear sequence homology to the catalytic domain of PI3Ks. The secondary structure 
prediction for the remainder of FRAP suggests that most of FRAP, apart from the 
catalytic domain, may consist of helical repeats folded into a right-handed superhelix 
5 as observed in the helical domain of PI3Ky. 

This first view of the structure of a PI3K provides a framework within which 
mutagenesis and detailed kinetic studies can be carried out to establish the enzymatic 
mechanism and the mode of activation by Ras and heterotrimeric G protein subunits. 

10 Methods 

Protein expression, purification and crystallisation. The cell-free extract of 
baculovirus-infected Sf9 cells expressing the His-tagged catalytic subunit of porcine 
PI3Ky (residues 1-143 deleted) was used for protein purification using Talon resin, 
followed by thrombin cleavage, anion and cation exchange, and gel filtration 

15 chromatography. Crystals were grown by mixing l(il of PI3K (at 3.5 to 4.0 mg/ml, in 
a buffer containing 20 mM Tris-HCl pH7.2, 1% v/v ethylene glycol, 1% w/v betaine, 
0.02% w/v CHAPS and 5 mM DTT) with l|il of a reservoir solution containing 150- 
200 mM U2SO4, 100 mM Tris-HCl pH 7.25 and 14-15% PEG 4000. 
Data collection and structure determination. Crystals have C2 symmetry with unit- 

20 cell dimensions of a=143.3 A, b=67.6 A, c=107.0 A, (3=95.9°, and contain one protein 
molecule in the asymmetric unit. Diffraction data were collected at ESRF beamlines 
ED2 and ID 14-4. Data were collected at 100K after freezing crystals in a 
cryoprotectant consisting of 150-200 mM U2SO4, 100 mM Tris-HCl pH 7.25, 12% 
glycerol and 20% PEG 4000. Data were processed using MOSFLM (23) and CCP4 
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programs (24). The structure was determined by multiple isomorphous replacement 
(MIR) methods. Heavy-atom positions were located using Solve (25) and refined 
with Sharp (26) (Table 1). A model was built into the electron density maps using the 
program O (27) and refined using CNS (28). The average B-factor for all atoms is 60 
A 2 . The structure has no residues in disallowed regions of the Ramachandran plot. 

The highest resolution data obtained were for the complex containing ATP and 
lutetium. Refinement of this complex resulted in a model with a free R-factor of 0.30 
to a resolution of 2.2 A. This complex has 854 residues visible in the electron density 
map. Comparison of crystals with and without ATP showed only minor differences in 
side-chain conformations in the active site residues. PDKs require a Mg 2+ or Mn 2+ 
cofactor for enzymatic activity. Complexes with Lu 3+ , Mg 2+ or Mn 2+ show that each 
of the metals binds at the same two sites. 
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The preceding description, utilize the present invention to its fullest extent. 
The preceding preferred specific embodiments are, therefore, to be construed as 
merely illustrative, and not limiting the remainder of the disclosure in any way 
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above and in the figures are hereby incorporated by reference in their entirety. Walker 
et al., Nature, 402:313-320, is incorporated by reference in its entirety, including Figs. 
1-7. 
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